Value-Based Planning for Teams of Agents in Stochastic Partially Observable Environments
نویسندگان
چکیده
منابع مشابه
The MADP Toolbox: An Open Source Library for Planning and Learning in (Multi-)Agent Systems
This article describes the Multiagent Decision Process (MADP) Toolbox, a software library to support planning and learning for intelligent agents and multiagent systems in uncertain environments. Key features are that it supports partially observable environments and stochastic transition models; has unified support for singleand multiagent systems; provides a large number of models for decisio...
متن کاملCase-Based Behavior Recognition to Facilitate Planning in Unmanned Air Vehicles
An unmanned air vehicle (UAV) can operate as a capable team member in mixed human-robot teams if the agent that controls it can intelligently plan. However, planning effectively in an air combat scenario requires understanding the behaviors of hostile agents in that scenario, which is challenging in partially observable environments such as the one we study. We present a Case-Based Behavior Rec...
متن کاملCoordinating Teams in Uncertain Environments: A Hybrid BDI-POMDP Approach
Distributed partially observable Markov decision problems (POMDPs) have emerged as a popular decision-theoretic approach for planning for multiagent teams, where it is imperative for the agents to be able to reason about the rewards (and costs) for their actions in the presence of uncertainty. However, finding the optimal distributed POMDP policy is computationally intractable (NEXPComplete). T...
متن کاملEnabling Supportive Communications in Decentralized Multi-Agent Teams
Supportive communication is an effective collaboration behavior identified in human teams in which team members share information proactively to improve overall team performance. Prior work formulated this objective as the Single-Agent in a Team Decision Problem (SAT-DP) where agents decide whether or not to communicate an unexpected observation during execution time. We extend the SAT-DP defin...
متن کاملA Framework for Optimal Sequential Planning in Multiagent Settings
Introduction Research in autonomous agent planning is gradually moving from single-agent environments to those populated by multiple agents. In single-agent sequential environments, partially observable Markov decision processes (POMDPs) provide a principled approach for planning under uncertainty. They improve on classical planning by not only modeling the inherent non-determinism of the probl...
متن کامل